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Abstract. We present a new family of graphical models that may have undirected, directed 
and bidirected edges. We name these new models marginal AMP (MAMP) chain graphs 
because each of them can be seen as the result of marginalizing out some nodes in an AMP 
chain graph. However, MAMP chain graphs do not only subsume AMP chain graphs but 
also regression chain graphs. We describe global and local Markov properties for MAMP 
chain graphs and prove their equivalence for compositional graphoids. We also characterize 
when two MAMP chain graphs are Markov equivalent. 



1. Introduction 

Chain graphs (CGs) are graphs with possibly directed and undirected edges, and no semidi- 
rected cycle. They have been extensively studied as a formalism to represent independence 
models. CGs extend Bayesian networks, i.e. directed graphs with no directed cycle, and 
Markov networks, i.e. undirected graphs. Therefore, they can model symmetric and asym- 
metric relationships between the random variables of interest, which is one of the reasons 
of their popularity. However, unlike Bayesian networks and Markov networks whose inter- 
pretation is unique, there are four diffe r ent interpretations of CGs as i ndependence models 
fICox and WermuthI Il993l . Il996l : lOrtonl . l2009l: ISonntag and Pend , l2013|). In this pa per, we 
are interested in the AMP interpretation f Andersson et al. .120011: Levitz et al., '2OOI') and in 
the multivariate regression (MVR) interpretation f Cox and Wermuth . 19931 . .1996). Although 
MVR CGs were originally represented using dashed directed and undirected edges, we prefer 
to represent them using solid directed and bidirected edges. 

In this paper, we combine the AMP and MVR interpretations of CGs. Specifically, we 
introduce a new family of graphical models that may have undirected, directed and bidirected 
edges. We call this new family marginal AMP (MAMP) CGs. We formally define MAMP CGs 
in Section [31 In that section, we also describe global and local Markov properties for MAMP 
CGs and prove their equivalence for compositional graphoids. In Section HJ we characterize 
when two MAMP CGs are Markov equivalent. Finally, we discuss in Section [5] how MAMP 
CGs relate to other existing graphical models such as regression CGs, maximal ancestral 
graphs, summary graphs and MC graphs. 



2. Preliminaries 

In this section, we review some concepts from graphical models that are used later in this 
paper. All the graphs and probability distributions in this paper are defined over a finite set 
V. All the graphs in this paper are simple, i.e. they contain at most one edge between any 
pair of nodes. The elements of V are not distinguished from singletons. The operators set 
union and set difference are given equal precedence in the expressions. 

If a graph G contains an undirected, directed or bidirected edge between two nodes Vi and 
V2, then we write that Vi - V2, Vi ^ V2 or Vi ^ V2 is in G. We represent with a circle, such 
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as in o->- or q-q, that the end of an edge is unspecified, i.e. it may be an arrow tip or nothing. 
The parents of a set of nodes X of G is the set padX) = V2 is in G, Vi i X and 

V2 e X}. The children of X is the set chciX) = {Vi\Vi ^ V2 is in G,ViiX and V2 e X}. The 
neighbors of X is the set nedX) = {Vi|Vi - V2 is in G, Vi i X and V2 e X}. The spouses of 
X is the set spg{X) - {Vi|Vi ■«-^ V2 is in G, Vi ^ X and V2 e X}. The adjacents of X is the set 
adaiX) = neciX) upaG(X)uchG(X)uspG(X). A route between a node Vi and a node in 
G is a sequence of (not necessarily distinct) nodes Vi, . . . , 14 st € adcC^+i) for all 1 < z < 
If the nodes in the route are all distinct, then the route is called a path. The length of a route 
is the number of (not necessarily distinct) edges in the route, e.g. the length of the route 
Vi , ■ • ■ , is n - 1. A route is called descending if Vi+i or Vi- Vi+i is in G for all 1 < i < n. 
A route is called strictly descending if \^ V^+i is in G for all 1 < i < n. The descendants of a 
set of nodes X of G is the set dedX) = {V^| there is a descending route from Vi to Vn in G, 
Vi e X and Vn ^ X}. The non-descendants of X is the set ndeaiX) -V\ deai^X). The strict 
ascendants of X is the set sandX) = {Vi\ there is a strictly descending route from Vi to Vn 
in G,Vi^X and Vn e X}. A route Vi, . . . , in G is called a cycle if K. = Vi. Moreover, it 
is called a semidirected cycle if Vi ^ V2 is in G and Vi V^+i, Vi <^ Vi+i or - Vi+i is in G 
for all 1 < 2 < n. An AMP chain graph (AMP CG) is a graph whose every edge is directed 
or undirected st it has no semidirected cycles. A MVR chain graph (MVR CG) is a graph 
whose every edge is directed or bidirected st it has no semidirected cycles. A set of nodes of 
a graph is connected if there exists a path in the graph between every pair of nodes in the 
set st all the edges in the path are undirected or bidirected. A connectivity component of a 
graph is a connected set that is maximal (wrt set inclusion). The subgraph of G induced by 
a set of its nodes X, denoted as Gx, is the graph over X that has all and only the edges in 
G whose both ends are in X. 

Let X, Y, Z and W denote four disjoint subsets of V. An independence model M is a set of 
statements X 1 mYIZ. Moreover, M is called graphoid if it satisfies the following properties: 
Symmetry X ]_mY\Z Y ]_mX\Z, decomposition X ]_mY lj W\Z ^ X ]_mY\Zi weak union 
XLmY^ W\Z => XLmY\Z u W, contraction XLmY\Z u W aX1mW\Z ^XImYu W\Z, 
and intersection X 1 pY\Z uW aX 1 pW\Z dY ^ X 1 pY l^ W\Z. Moreover, M is called 
compositional graphoid if it is a graphoid that also satisfies the composition property X 1 
mY\Z aXLmW\Z ^ XLmYuW\Z. 

We now recall the semantics of AMP CGs and MVR CGs. A node S in a path p in an 
AMP CG G is called a triplex node in p \i A ^ B C , A ^ B -C , or A- B <^ C is subpath 
of p. Moreover, p is said to be Z-open with Z ^V when 

• every triplex node in p is in Z u sanoiZ), and 

• every non-triplex node B in p is outside Z, unless A - B - C is a subpath of p and 
paoiB) \ Z # 0. 

Let X, Y and Z denote three disjoint subsets of V. When there is no Z-open path in G 
between a node in X and a node in Y, we say that X is separated from Y given Z in G 
and denote it as X ]_gY\Z. The independence model induced by G is the set of separation 
statements X 1g Y\Z. 

A node i? in a path p in a MVR CG G is called a triplex node in p if o->- S -^-q G is a 
subpath of p. Moreover, p is said to be Z-open with Z ^V when 

• every triplex node in p is in Z u sanaiZ), and 

• every non-triplex node B in p is outside Z. 

Let X, Y and Z denote three disjoint subsets of V. When there is no Z-open path in G 

between a node in X and a node in Y, we say that X is separated from Y given Z in G 
and denote it as X 1gY\Z. The independence model induced by G is the set of separation 
statements X Iq Y\Z. 



3 



3. Definition 

A graph G containing possibly directed, bidirected and undirected edges is a marginal 
AMP (MAMP) CG if 

CI. G has no semidirected cycle, 

C2. G has no cycle Vl, . . . , = Vi st V2 is in G and Vi - Vi+i is in G for all 1 < z < n, 
and 

C3. if V1-V2- V3 is in G and spg{V2) + 0, then Vi - V3 is in G too. 

A set of nodes of a MAMP CG G is undirectly connected if there exists a path in G 
between every pair of nodes in the set st all the edges in the path are undirected. An 
undirected connectivity component of G is an undirectly connected set that is maximal (wrt 
set inclusion). We denote by ucg{A) the undirected connectivity component a node A of G 
belongs to. 

The semantics of MAMP CGs is as follows. A node S in a path p in a MAMP CG G is 
called a triplex node in p ii A ^ B C , A ^ B - G , 01 A - B C is a, subpath of p. 
Moreover, p is said to be Z-open with Z when 

• every triplex node in p is in Z u sandZ), and 

• every non-triplex node 5 in p is outside Z, unless A - B - G is a subpath of p and 
spg(B) or paciB) \Z + 0. 

Let X, Y and Z denote three disjoint subsets of V . When there is no Z-open path in G 
between a node in X and a node in y, we say that X is separated from Y given Z in G and 
denote it as X ]_gY\Z. We denote by X IgY\Z that X ]_gY\Z does not hold. Likewise, we 
denote by X LpY\Z (respectively X j_p Y\Z) that X is independent (respectively dependent) 
of Y given Z in a probability distribution p. The independence model induced by G, denoted 
as I{G), is the set of separation statements X 1g Y\Z. We say that p is Markovian wrt G 
when X lpY\Z if X 1gY\Z for all X, Y and Z disjoint subsets of V. Moreover, we say that 
p is faithful to G when X lpY\Z iff X 1gY\Z for all X, Y and Z disjoint subsets of V. 

Note that if a MAMP CG G has a path Vi - - • ■ • - K st spG^Vi) ^ for all 1 < z < n, 
then Vi -Vn must be in G. Therefore, the independence model induced by a MAMP CG is 
the same whether we use the definition of Z-open path above or the following simpler one. 
A path p in a MAMP CG G is said to be Z-open when 

• every triplex node in p is in Z u sanG{Z), and 

• every non-triplex node S in p is outside Z, unless A - B - G is a subpath of p and 
paG{B) \ Z t0. 

The motivation behind the three constraints in the definition of MAMP CGs is as follows. 
The constraint CI follows from the semidirected acyclicity constraint of AMP CGs and MVR 
CGs. For the constraints C2 and C3, note that typically every missing edge in a graphical 
model corresponds to a separation. However, this may not be true for graphs that do not 
satisfy the constraints C2 and C3. For instance, the graph G below does not contain any 
edge between B and D but B J.gD\Z for all Z ^ V \ {B,D}. Likewise, G does not contain 
any edge between A and E but AJ_gE\Z for all Z \ {A, E}. 

A^^^^I3^^^C^^^^~1J^ E 
I 

F 

Since the situation above is counterintuitive, we enforce the constraints C2 and C3. The- 
orem [2] below shows that every missing edge in a MAMP CG corresponds to a separation. 

Given a MAMP CG G, let G denote the AMP CG obtained by replacing every bidirected 
edge A ^ B in G with A L^^b B. Note that G and G represent the same separations 
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over V. Therefore, every MAMP CG can be seen as the result of marginahzing out some 
nodes in an AMP CG, hence the name. 

Note that AMP CGs and MVR CGs are special cases of MAMP CGs. However, MAMP 
GGs are a proper generalization of AMP CGs and MVR CGs, as there are independence 
models that can be induced by the former but not by the latter. An example follows (we 
postpone the proof that it cannot be induced by AMP CGs and MVR CGs until after Theorem 

SI). 




The theorem below shows that the independence models induced by MAMP CGs are not 
arbitrary in the probabilistic framework. 

Theorem 1. For any MAMP CG G, there exists a regular Gaussian probability distribution 
p that is faithful to G. 

Proof. It suffices to replace every bidire cted edge A B in G with A Lab B to create 



an AMP CG G, apply Theorem 6.1 by iLevitz et al.l ( 1200 ll ) to conclude that there exists a 



regular Gaussian probability distribution q that is faithful to G, and then let p be the marginal 
probability distribution of q over V. □ 

Corollary 1. Any independence model induced by a MAMP CG is a compositional graphoid. 

Proof. It follows from Theorem [T] by just noting that the set of independencies in a ny regular 



Gaus sian probability distribution satisfies the compositional graphoid properties (jStudenyl . 



20051 . Sections 2.2.2, 2.3.5 and 2.3.6). □ 



Finally, we show below that the independence model induced by a MAMP CG coincides 
with certain closure of certain separations. We define the local separation base of a MAMP 
CG G as the separations 

• A 1 B\paG{A) for all non-adjacent nodes A and B oi G st B i dedA), and 

• AlB\neG{A)upaG{AuneG(A)) for all non-adjacent nodes A and BofGstAe dcG^B) 
and B € deG^A), i.e. ucg{A) = ucg{B). 

We define the compositional graphoid closure of the local separation base of G, denoted 
as cl{G), as the set of separations that are in the base plus those that can be derived from 
it by applying the compositional graphoid properties. We denote the separations in cl{G) as 
Xl,i(^G)Y\Z. 

Theorem 2. For any MAMP CG G, zf X i^i^g)Y\Z then X1gY\Z. 

Proof. Since the independence model induced by G is a compositional graphoid by Corollary 
m it suffices to prove that the local separation base of G is a subset of the independence 
model induced by G. We prove this next. Let A and B be two non- adjacent nodes of G. 
Consider the following two cases. 

Case 1: B i deG{A). Then, every path between A and 5 in G falls within one of the 
following cases. 

Case 1.1: A - Vi <- V2 . . .Vn = B. Then, this path is not paG(A)-open. 

Case 1.2: A - Vi <>->■ V2 . . .Vn = B. Note that V2 V„ because, by assumption, A 
and B are non-adjacent in G. Note also that V2 i paG^A) due to the constraint 
CI. Then, V2 V3 must be in G for the path to be paG(^)-open. By repeating 
this reasoning, we can conclude that A - Vi <^ V2 ^ V^, ^ . . . Vn = B is m. G. 
However, this contradicts the assumption that B i dec (A). 
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Case 1.3: A = Vi - V2 - . . . -Vm ^ Kn+i . . .Vn = B. Note that Kn i paciA) due to 

the constraint CI. Then, this path is not paG(A)-open. 
Case 1.4: A = Vi -V2 - . . . -V^ Vm+i ■ ■ ■Vn = B. Note that Vm+i 4= Vn because, by 

assumption, B i dec^A). Note also that Vm+i i pacC^) due to the constraint CI. 

Then, V^+i Kn+2 must be in G for the path to be paG'(^)-open. By repeating 

this reasoning, we can conclude that A = Vi -V2 - . . . - Vm ^ Vm+i ... ^Vn = B 

is in G. However, this contradicts the assumption that B i dedA). 
Case 1.5: A = Vi-V2 - ■ ■ ■ - Vm-^ Vm+i . . .Vn = B. Note that Vm i paaiA) due to 

the constraint CI. Then, this path is not paG(^)-open. 
Case 1.6: A = Vi - V2 - . . . - Vn = B. This case contradicts the assumption that 

B i decXA). 

Case 2: A e dedB) and B e dedA), i.e. ucg{A) = ucg{B). Then, there is an undi- 
rected path p between A and B in G. Then, every path between A and B in G falls 
within one of the following cases. 

Case 2.1: A = Vi <- V2 . . .Vn = B. Then, this path is not (neaiA) u paaiA u 
neG(A)))-open. 

Case 2.2: A = Vi <^ V2 ■ ■ - Vn = B. Note that V2 + Vn due to p and the constraints 
CI and C2. Note also that V2 i nedA) upaG{AuneG{A)) due to the constraint 
CI. Then, V2 V3 must be in G for the path to be (neG{A) upaG(ALineG{A)))- 
open. By repeating this reasoning, we can conclude that A = Vi o-^ V2 ^ V3 ^ 
. . . ^ Vn = B is in G. However, this together with p violate the constraint CI. 

Case 2.3: A = Vi -V2 ^ V^ . . .Vn ^ B. Then, this path is not (ncGiA) u paGiA u 
neG'(y4)))-open. 

Case 2.4: A = Vi - V2 ^ T/3 ■ • • K = B. Note that V3 + Vn due to p and the 

constraints CI and C2. Note also that V3 i neG^A) u paG^Au ucg^A)) due to 
the constraints CI and C2. Then, V3 V4 must be in G for the path to be 
{neG^A) upaG(^uneG(^)))-open. By repeating this reasoning, we can conclude 
that A = Vi - V2 °^ V2, ^ . . . ^ Vn = B is in G. However, this together with p 
violate the constraint CI. 

Case 2.5: A = Vi - V2 - ^3 . . . K = ^ st spg{V2) = 0. Then, this path is not 
{neG{A) upaG^A u neG(^)))-open. 

Case 2.6: A = Vi -V2 - . . . - V„, - Kn+i ^ ...Vn ^ B st spGiVi) + for aU 
2<i<m. Note that Vi e neciVi) for all 3 < i < m + 1 by the constraint C3. Then, 
this path is not (nee (A) u paG{A u neG{A)))-open. 

Case 2.7: A = Vi - V2 - . . . - V^ - V^+i Vm+2 . . . K = 5 st spGiV) + for all 
2<i<m. Note that Kn+2 ^ Ki due to p and the constraints CI and C2. Note also 
that Vm+2 i neaiA) u paG(Au neaiA)) due to the constraints CI and C2. Then, 
Kn+2 Hiust be in G for the path to be (neaiA) u paaiAu neaiA)))-open. 

By repeating this reasoning, we can conclude that A = V1-V2 - ■ . ■ - Vm- Vm+i ^ 
Vjn+2 . . . ^ Vn = B is in G. However, this together with p violate the constraint 
CI. 

Case 2.8: A = Vi - V2 - . . . - V„, - Vm+i - Vm+2 . . . K = 5 st spaiVi) + for all 
2 <i <m and spaiVm+i) = 0- Note that Vi e neaiVi) for all 3 < i < m + 1 by the 
constraint C3. Then, this path is not ineaiA) u paaiA u neGiA)))-open. 

Case 2.9: A ^ Vi - V2 - . . . - Vn ^ B st spaiV) * for aU 2 < i < n - 1. Note that 
Vi € neaiVi) for all 3 < i < n by the constraint C3. However, this contradicts the 
assumption that A and B are non- adjacent in G. 

□ 



Theorem 3. For any MAMP CO G, if X 1gY\Z then XLd(G)V\Z. 
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Proof. We start by recalling some definitions from lAndersson et al.l ( 1200 ll . Section 2). Let F 
be an AMP CG and F' the result of removing all the directed edges from F. Given a set 
U ^V, let W = U u sanciU) and W = \JA<^w'^CGi_A). Let F[W] denote the graph whose 
nodes and edges are the union of the nodes and edges in F^r and F^,. F[W] is called an 
extended subgraph of F. An undirected graph is called complete if it has an edge between 
any pair of nodes. In an AMP CG, a triplex {{A,C},B) is an induced subgraph of the 
form A ^ B ^ C, A ^ B - C, or A - B ^ C. Augmenting a triplex {{A,C},B) means 
replacing it with the complete undirected graph over {A, B, C}. In an AMP CG G, a 2-biflag 
{{A,D},{B,C}) is a subgraph of the form A-^B-C-^D st A is not adjacent to C in G 
and B is not adjacent to D in G. Augmenting a 2-biflag {{A, D}, {B, G}) means replacing it 
with the complete undirected graph over {A, B, G, D}. Augmenting an AMP CG F, denoted 
as F"', means augmenting all its triplexes and 2-bifiags and converting the remaining directed 
edges into undi rected edges. Not e that X l fY\Z iff X is separated from Y given Z in 



F[X yjYijZY fiLevitz et al.l . 1200 ll . Theorem 4.1). 

Given an undirected graph F and a set [/ £ y, let F^ denote the undirected graph over 
U resulting from adding an edge A - B to Fu ii F has a path between A and B whose only 
nodes in U are A and B. F^ is sometimes called the marginal graph of F for U. 

Now, we start the proof per se. Let G denote the AMP CG obtained by replacing every 
bidirected edge A B m. G with A <- Lab ^ B. The node Lab is called latent. Let 
G = {G[X u y u Z\°-y^ . As mentioned before, G and G represent the same separations over 
V . Then, X LqY\Z iff X is separated from Y given Z in G. Note that the separations in G 
coincide with the graphoid closure of the separat ions ALj^yjG) \ A \ a(i^(A)|a(i^(^) for all 



A e V^(G), where ViG) denotes the nodes in G ( Bouckaertl . 19951 . Theorem 3.4). Therefore 



to prove that X 1ci(g)Y\Z, it suffices to prove that A1ci(g)V{G) \ A \ a(i^(A)|a(i^(A) for 
all A e V{G). Let Ki, . . . , Kn denote the connectivity components of Gy^^ st if A i? is in 
Gyf-Qy then A^ Ki and B e Kj with i < j. Consider the following cases. 

Case 1: Assume that A e Kn- Note that if D e adQ{A), then D € nedA) u padA u 
neG(A))LiA' uneG(A')LipaG{A' uneciA')) for some A' eV st G has a path A.. . <^ A' 
whose every node is in Kn- To see it, note that if D e ad^A) \ neG{A) \ paciA u 
neG{A)), then G[X u y u Z]" must have a path between A and D whose every 
node except A and D is latent. Then, G[X uY u Z] must have a path of the form 
A . . . L A' ^ D, A . . . L ^ A' - D, A . . . L ^ A' ^ D or A . . . L ^ A' - A" ^ D, where 
L is a latent node and every non- latent node between A and A' is in Kn- Note also 
that A' e deG{A) iff A e deciA'), because A, A' e Kn- Let B contain all those A' st 
A' € deG{A), i.e. ucg{A) = ucg{A'). Let C contain all those A' st A' i dedA). 

Consider any A' € hcg^A)- Note that dedA) = de^A')- Then, paG^A^nedA)) g 
ndcG^A) = ndeG(A')- Therefore, 

(1) Alci^G)ndeG{A) \ paG{A)\paG{A) and 

(2) A' 1 ci(G)^deG(A') \ paG(A')\paG{A') follow from the local separation base of G 
by repeated composition, which imply 

(3) Ala(G)ndeG(A) \paG(AuneG(A))\paG(AuneG(A)) and 

(4) A' lci(G)ndeG(A') \paG(AuneG(A))\paG{ALineG{A)) by weak union. Therefore, 

(5) AuneG{A) ]_ci(G)ndeG{A) \paG{AuneG{A))\paG{AuneG{A)) by repeated sym- 
metry and composition, which implies 

(6) Alci(G)ndeG{A) \paG{AuneG{A))\neGiA)upaGiAuneG{A)) by symmetry and 
weak union. Note that 

(7) A 1 ci(g)UCg(A) \ A \ neG{A)\neG{A) u paG^A u neG{A)) follows from the local 
separation base of G by repeated composition, which implies 

(8) Alci(G)[ndeG{A) \paG{AuneG{A))]u[ucG{A) \ A\neGiA)]\neG{A)upaGiAu 
neG{A)) by symmetry and composition on (6) and (7). 
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Consider any B ^ B \ nedA). By repeating the reasoning above, we can conclude 
(9) Bla(G)[ndeG(B)\paG(BuneG(B))]u[ucG(B)\B\neG(B)]\neG(B)upaG(Bu 

neciB)). 

Recall that deciA) = dcQ^B) and ucg{A) = ucg{B). Then, paG(A u Bu ncG^A u 
B)) c ndeciA) = ndeG(B), and nedA u i5) c ucg(A) = ucg(B). Therefore, 

(10) Aici(^G)['ndeG{A)\paG{AuBuneG{ALiB))]Li[ucG{A)\A\B\neG{AuB)]\neG{Au 
B)LipaG(AuBuneG(AuB)) and 

(11) Bl,i^G)[ndeGiB)^paGiAuBuneGiAuB))]u[ucGiB)^A^B^neGiAuB)]\neGiAu 
B) ^pttG^A u ,B u ncG^A u B)) by weak union and decomposition on (8) and (9). 
Therefore, 

(12) A vj B 1 ci{G)[ndeG{A) \paG{A u i5 u neG{A\jB))] u [ucg{A) \ A\B\ neG{A u 
B)]\neG{AuB)upaG(AuBuneG{AuB)) by repeated symmetry and composition, 
which implies 

(13) Ai a(G) [ndcGiA) \ paG (AuBu neG(A ^B))]u [ucg ( A) \A\B\ neG(A uB)]\Bu 
neG(A uB)u paG(A uBu neG(A uB)) by symmetry and weak union. 

Finally, note that C u neG{C) upaG{C u neG{C)) £ ndeG{A). Therefore, 

(14) Alci^G)[ndeG{A) \C \neGiC) \paG{ALiBuCL>neG{AuBuC))]u[ucG{A) \ A\ 
B\neGlAuB)]\BuCuneG(AuBuC)upaG(AuBuCuneG(AuBuC)) by weak 
union on (13), which implies 

(15) Alci(G)V(G) \ A \ adQ(A)\adQ{A) by decomposition. 

Case 2: Assume that A e K„_i. Let H = {G[X uYu Z \ K^YY . Note that H is a 
subgraph of G and, thus, a,d-jj-(A) £ a,d-^(A). Let B = ad-(j(A) n Kn and C = Kn \ B. 
Note that B or C may be empty. Note also that paG^B) c ad-Q^A) u A. To see it, 
note that this is evident for any D e chG{A) u neG{chG{A)). On the other hand, if 
D & B \ chG(A) \ neG(chG(A)) then G[X uY u Z]" must have a path between A and 
D whose every node except A and D is latent. Then, G[X uY u Z] must have a path 
of the form A . . . L ^ D or A . . . L ^ A' - D, where L is a latent node and A' is a 
non-latent node. Then, clearly paG^D) £ ad-Q^A) u A. 
Consider any B & B. Then, 

(1) B lci(G)V(H) \ paG(B)\paG(B) follows from the local separation base of G by 
repeated composition, which implies 

(2) B ]_ ci(^G)^ {H) \ paG{B)\paG{B) by weak union, which implies 

(3) Blci{G)V{H) \ paG{B)\paG{B) by repeated symmetry and composition, which 
implies 

(4) B lci(G)V(H) \ A \ adQ{A)\adQ(A) \ Bu Ahy weak union and the fact, shown 
above, that paG{B) 9 ao?^(A) u A. 

Note that H is a proper marginal augmented extended subgraph. Then, 

(5) Alci(^G)y{H) \ A \ ad-jj{A)\ad-jj{A) follows from repeating Case 1 for H, which 
implies 

(6) Alci(G)V(H) \ A \ adQ(A)\adQ(A) \B by weak union, which imphes 

(7) A u B lci(G)V(H) \ A \ adQ(A)\adQ(A) \ B by symmetry and contraction on (4) 
and (6), which implies 

(8) Alci(G)V{H) \ A \ adQ{A)\adQ{A) by symmetry and weak union. 
Finally, consider any C e C. Then, 

(9) Clci(G)V(G) \ C \ a%(C)|a%(C) by Case 1, which implies 

(10) G lci{G)A\V{G) \ A \ G by weak union, which implies 

(11) C lci(G)A\V{G) \ ^4 \ C by repeated symmetry and intersection, which implies 

(12) C1ci(g)A\V(H) \AuB, which implies 

(13) C u [V{H) \ A \ adQ{A)] 1 ci(G)A\adQ{A) by symmetry and contraction on (8) 
and (12), which implies 
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(14) Al^i(^G)V{G) \A\ ad^{A)\ad^{A) by symmetry. 
Case 3: Assume that A e Ki with 1 < i < n - 2. Then, repeat Case 2 replacing Kn-i 
with Ki, and letting B = adQ{A) n Ki+i and C = Ki+i u . . . u Kn ^ B. 

□ 

4. Markov Equivalence 

We say that two MAMP CGs are Markov equivalent if they induce the same independence 
model. In a MAMP CG, a triplex {{A, C},B) is an induced subgraph of the form A <^ B ^C, 
A^B-C,otA-B-^C. We say that two MAMP CGs are triplex equivalent if they have 
the same adjacencies and the same triplexes. 

Theorem 4. Two MAMP CGs are Markov equivalent iff they are triplex equivalent. 

Proof. We first prove the "only if part. Let Gi and G2 be two Markov equivalent MAMP 
CGs. First, assume that two nodes A and C are adjacent in G2 but not in Gi. If A and G are 
in the same undirected connectivity component of Gi, then A lG\neGj^{A)upaGi {Auneci (A)) 
holds for Gi by Theorem [2] but it does not hold for G2, which is a contradiction. On the other 
hand, if A and G are in different undirected connectivity components of Gi, then A i deci (C) 
or C ^ dec^iA). Assume without loss of generality that A i dec^i^G). Then, A ]_G\paG^{G) 
holds for Gi by Theorem |2]but it does not hold for G21 which is a contradiction. Consequently, 
Gi and G2 must have the same adjacencies. 

Finally, assume that Gi and G2 have the same adjacencies but Gi has a triplex {{A, G},B) 
that G2 does not have. If A and G are in the same undirected connectivity component of 
Gi, then AlG\neGi(A) u pac^^iA u neciiA)) holds for Gi by Theorem |2j Note also that 
B i nec^^A) yjpaG^{AyjneGi{A)) because, otherwise, Gi would not satisfy the constraint CI 
or C2. Then, A ]_G\neGi{A)upaGi (Aurieci {A)) does not hold for G2, which is a contradiction. 
On the other hand, if A and G are in different undirected connectivity components of Gi, 
then A i deGi(G) or G i deGi(A). Assume without loss of generality that A i deGi(G). Then, 
AlG\paGi(G) holds for Gi by Theorem El Note also that B i paGi(G) because, otherwise, 
Gi would not have the triplex {{A, G}, B). Then, AlG\paGj^{G) does not hold for G2, which 
is a contradiction. Consequently, Gi and G2 must be triplex equivalent. 

We now prove the "if part. Let Gi and G2 be two triplex equivalent MAMP CGs. We just 
prove that all the non-separations in Gi are also in G2. The opposite result can be proven in 
the same manner by just exchanging the roles of Gi and G2 in the proof. Specifically, assume 
that a 1 (3\Z does not hold for Gi. We prove that a 1 (3\Z does not hold for G2 either. We 
divide the proof in three parts. 

Part 1 

We say that a path has a triplex ({^4, G},B) if it has a subpath of the form A 0^ B ^ G, 
Ao-^B-G, or A-B-^G. Let pi be any path between a and /3 in Gi that is Z-open st 
(i) no subpath of pi between a and in Gi is Z-open, (ii) every triplex node in pi is in Z, 
and (iii) pi has no non-triplex node in Z. Let p2 be the path in G2 that consists of the same 
nodes as pi. Then, p2 is Z-open. To see it, assume the contrary. Then, one of the following 
cases must occur. 

Case 1: p2 does not have a triplex {{A, G},B) and B e Z. Then, pi must have a triplex 
({A,G}, B) because it is Z-open. Then, A and G must be adjacent in Gi and G2 
because these are triplex equivalent. Let ^1 be the path obtained from pi by replacing 
the triplex {{A, G},B) with the edge between A and G in Gi. Note that gi cannot be 
Z-open because, otherwise, it would contradict the condition (i). Then, gi is not Z- 
open because A or G do not meet the requirements. Assume without loss of generality 
that G does not meet the requirements. Then, one of the following cases must occur. 
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Case 1.1: does not have a triplex {{A,D},C) and C e Z. Then, one of the 
following subgraphs must occur in Gia 



However, the first three subgraphs imply that pi is not Z-open, which is a contra- 
diction. The fourth subgraph implies that Qi is Z-open, which is a contradiction. 
Case 1.2: qi has a triplex ({A,D},C) and C i Z u sanG-^{Z). Note that C cannot 
be a triplex node in pi because, otherwise, pi would not be Z-open. Then, one 
of the following subgraphs must occur in Gi. 

A^^^B^C ^ D aS^B^C — D A^^^IT^C — D A^^B^ C — D 



However, the first and second subgraphs imply that C e Z u sanc^ (Z) because 
B e Z, which is a contradiction. The third subgraph implies that i? - D is in 
Gi by the constraint C3 and, thus, that the path obtained from pi by replacing 
B - C - D with B - D is Z-open, which contradicts the condition (i). For the 
fourth subgraph, assume that A and D are adjacent in Gi. Then, one of the 
following subgraphs must occur in Gi. 



D^E 



E 



However, the first subgraph implies that the path obtained from pi by replacing 
A -> B - C - D with A D is Z-open, because D i Z since pi is Z-open. 
This contradicts the condition (i). The second subgraph implies that the path 
obtained from pi by replacing A ^ B - C - D with A-^ D is Z-open, because D e 
Z \J sanc^^Z) since pi is Z-open. This contradicts the condition (i). Therefore, 
only the third subgraph is possible. Thus, by repeatedly applying the previous 
reasoning, we can conclude without loss of generality that the following subgraph 
must occur in Gi, with n > 4, Vi = A, V2 = B , V3 = G , = D and where Vi and 
Vn are not adjacent in Gi. Note that the subgraph below covers the case where 
A and D are not adjacent in the original subgraph by simply taking n = 4. 




V,^V2-Vs-V, Vn-l-Vn 



Since Vi and Vn are not adjacent in Gi, Gi has a triplex {{Vi,Vn},Vn-i) and, 
thus, so does G2 because Gi and G2 are triplex equivalent. Then, one of the 
following subgraphs must occur in G2. 

Vi Vn-l «3 Vn Vi ^^^"^^ Vn-1 ^ Vn Vi "^""^^ Vn^l " K 



^If Qi does not have a triplex ({A, D}, C), then A<-C, C^D or A-C-D must be in Gi . Moreover, 
recall that B is a triplex node in pi. Then, A^B^C\A^B^C,A^B-C,A^B^C,A^B^C, 
A ^ B - C, A-B^CotA-B^C must be in Gi . However, if <- C is in Gi then the only legal options 
are those that contain the edge B ^ G. On the other hand, if A - C - 1? is in Gi then the only legal options 
are A ^ i? ^ G and A^ B ^ G. 
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Note that Vi,. . . ,Vn must be a path in G2, because Gi and G2 are triplex equiv- 
alent. Note also that this path cannot have any triplex in 6*2- To see it, recall 
that we assumed that p2 does not have a triplex ({A, C},B). Recall that Vi = A, 
V2 - B, V3 - C. Moreover, if the path Vi, . . . ,Vn has a triplex ({V^, Vi+2}, ^+1) in 
G2 with 2 < i < n - 2, then Vi and Vi+2 must be adjacent in Gi and G2, because 
such a triplex does not exist in Gi , which is triplex equivalent to G2 ■ Specifically, 
Vi - Vi+2 must be in Gi because, as seen above, Vi - Vi+i - Vi+2 is in Gi. Then, the 
path obtained from pi by replacing Vi - Vi+i - Vi+2 with Vi - Vi+2 is Z-open, which 
contradicts the condition (i). However, if the path Vi,...,V^ has no triplex in 
6*2, then every edge in the path must be directed as in the case of the first and 
second subgraphs above, whereas every edge in the path must be undirected or 
directed as in the third subgraph above. Either case contradicts the constraint 
CI or C2. 

Case 2: Case 1 does not apply. Then, p2 has a triplex {{A, C},B) and B i ZusanciiZ). 
Then, pi cannot have a triplex {{A,G},B). Then, A and C must be adjacent in Gi 
and G2 because these are triplex equivalent. Let qi be the path obtained from pi by 
replacing the triplex {{A, C},B) with the edge between A and C in Gi. Note that ^1 
cannot be Z-open because, otherwise, it would contradict the condition (i). Then, ^1 
is not Z-open because A or C do not meet the requirements. Assume without loss of 
generality that C does not meet the requirements. Then, one of the following cases 
must occur. 

Case 2.1: gi has a triplex {{A,D},C) and C i Z u sanciiZ). Then, one of the 
following subgraphs must occur in GiU 

A^^^B^C^D A^^^B^C — D A^^li^ G ^ D 
A^^^ls^C — D A^^^^IT^C^D A^^^^IT^G^D 



However, this implies that C is a triplex node in pi, which is a contradiction 
because pi is Z-open but G i Z u sancj^(Z). 
Case 2.2: qi does not have a triplex ({A, D},G) and G e Z. Then, A^ G, G D 
or A- G-D. 

Case 2.2.1: If C ^ D or A-G-D, then one of the following subgraphs must 
occur in Gi. 

Ao^^H^G ^ D A^^^B^G — D A^li^ G — D 



However, the first and second subgraphs imply that pi is not Z-open, which 
is a contradiction. The third subgraph implies that qi is Z-open, which is 
a contradiction. 

Case 2.2.2: If A <- C then ({A, D},C) is not a triplex in qi. However, note 
that pi must have a triplex ({i?,D},C), because pi is Z-open and C e Z. 
Then, one of the following subgraphs must occur in Gi. 

a'^^Ib^g^d a^^Ib^g — D A^^^^B^G^D 



% ei has a triplex {{A,D},C), then A ^ C « £», A^C-D,A^C^D,A^C-D or A - C « D must 
be in Gi. Moreover, recah that B is not a triplex node in pi. Then, A-^B-(-C,A-<-B^C,A^B^C, 
A<-B-C,A^B-^C,A^B^C,A-B^CorA-B-C must be in d. However, if A ^ C is in d 
then the only legal options are those that contain the edge B ^ C. On the other hand, if A C is in Gi 
then the only legal option is A ^ B ^ C. Finally, if A - C is in Gi then the only legal options are A ^ B ^ C 
and A- B-C. 
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Assume that A and D are adjacent in Gi. Then, A D must be in Gi. 
Moreover, D ^ Z because, otherwise, we can remove B and C from pi and 
get a Z-open path between A and B in Gi that is shorter than pi, which 
contradicts the condition (i). Then, D must be a triplex node in pi. Then, 
one of the foUowing subgraphs must occur in Gi. 



A^B^C^D^E A^B^C^D—E A^B^C—D^E 



A^B G^D^E A^B C^D E 



Thus, by repeatedly applying the previous reasoning, we can conclude with- 
out loss of generality that the following subgraph must occur in Gi, with 
n > i, Vi - A, V2 - B , V3 = G , V4 - D and where Vi and Ki are not adjacent 
in Gi. Note that the subgraph below covers the case where A and D are 
not adjacent in the original subgraph by simply taking n = 4. 



Note that is a triplex node in pi for all 3 < i < rz - 1. Then, Vi^Z for all 
3<i<?7,-l by the condition (ii) because p\ is Z-opcn. Then, Vi must be 
a triplex node in p2 for all 3 < i < n - 1 because, otherwise, Case 1 would 
apply instead of Case 2. Recall that V2 = -B is also a triplex node in p^. 
Note that G\ does not have a triplex ({Vi, and, thus, G2 does 

not have it either because these are triplex equivalent. Then, one of the 
following subgraphs must occur in G^- 

Vi'^^Vn-l^Vn Fl^^K-l^K ^l^^K-l-K 



However, the first subgraph implies that V^_i is not a triplex node in p2, 
which is a contradiction. The second subgraph implies that G2 has a cycle 
that violates the constraint CI. To see it, recall that is a triplex node in 
P2 for all 2 < i < n-1 and, thus, Vi ^ Vi+i is not in G2 for all 1 < i < n-2. The 
third subgraph implies that Vn-2 Ki-i is not in G2 because, otherwise, 
Vi and Vn would be adjacent by the constraint C3. Therefore, Vn-2 Ki-i 
must be in G2 because V^_i is a triplex node in p2. However, this implies 
that Vn-2 is not a triplex node in p2, which is a contradiction. 

Part 2 

Let pi be any of the shortest Z-open paths between a and /3 in Gi st all its triplex nodes 
are in Z. Let p2 be the path in G2 that consists of the same nodes as pi. We prove below 
that p2 is Z^-open. We prove this result by induction on the number of non-triplex nodes of 

pi that arc in Z. If this number is zero, then Part 1 proves the result. Assume as induction 
hypothesis that the result holds when the number is smaller than m. We now prove it for m. 

Let pi''^ denote the subpath of pi between the nodes A and B. Let C be any of the non- 
triplex nodes of pi that are in Z. Note that there must exist some node D ^pac^iG) \ Z for 
pi to be Z-open. If D is in pi, then pf-^ u D ^ Gu p^''^ or p^'*^ uG ^ Du pf ''^ is a Z-open 
path between a and f3 in Gi that has fewer than m non-triplex nodes in Z. Then, the result 
holds by the induction hypothesis. On the other hand, if D is not in pi, then p"''^ u C D 
and D ^ G ^ p^ are two paths. Moreover, they are Z-open in Gi and they have fewer than 
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m non-triplex nodes in Z. Then, by the induction hypothesis, there are two Z-opcn paths 
p^-^ and P2 '^ in G2 st the former ends with the nodes C and D and the latter starts with 
these two nodes. Now, consider the following cases. 



Case 1: p^'^ ends with A-C^D. Then, pf ^ starts with D^C-BorD^C^B. 

Then, p2 = p'^'" u p^'^ is Z-open a path in either case. 
Case 2: p"^^ ends with A-C ^ B. Then, p^'^ starts with D^C-B or D^C^B. 

Then, p2 = P2''" u p*^'^ is Z-open a path in either case. 
Case 3: p^-^ ends with A<^C-D. Then, ^ starts with D -C ^ B, or D -C - B st 

there is some node E e pacaCC*) ^ Z. Then, p2 - '-' P2 Z-open a path in either 

case. 

Case 4: p^^-° ends with A^^C ^ D. Then, pf ^ starts with D <^ C-B or D <^ C ^ B. 

Then, p2 = p"'*" u p^''^ is Z-open a path in either case. 
Case 5: pg'^ ends with A - C - D st there is some node E e paG2{C) \ Z. Then, p^'^ 

starts with D - C <r-o B, or D - C - B st there is some node F e paca (C*) \ Then, 

P2 = P2'*^ '-' is a Z-open path in either case. 



Part 3 

Assume that Part 2 does not apply. Then, every Z-open path between a and (3 in Gi has 
some triplex node Bi that is outside Z because, otherwise. Part 2 would apply. Note that 
for the path to be Z-open, Gi must have a subgraph Bi ^ ... ^ Bn st Bi, ... , Bn-i i Z but 
Bn e Z. Let us convert every Z-open path between a and (5 in Gi into a route by replacing 
each of its triplex nodes Bi that are outside Z with the corresponding route Bi ->■... ^ Bn 
. . . <^ Bi. Let Qi be any of the shortest routes so-constructed. Let pi be the path from which 
^1 was constructed. Note that pi cannot be Z-open because, otherwise. Part 2 would apply. 
Let W denote the set of all the triplex nodes in pi that are outside Z. Then, pi is one of the 
shortest (ZuVF)-open paths between a and /S in Gi st all its triplex nodes are in ZuW. To 
see it, assume to the contrary that p[ is a (Z u l^)-open path between a and 13 in Gi that is 
shorter than pi and st all the triplex nodes in p[ are in Z u W. Let g[ be the route resulting 
from replacing every node Bi of p'l that is in W with the route Si i?„ i?i 

that was added to pi to construct ^1. Clearly, g[ is shorter than ^1, which is a contradiction. 
Let g2 and p2 be the route and the path in G2 that consist of the same nodes as gi and pi. 
Note that p2 is {Z u W)-open by Part 2. 

Consider any of the routes i?i i?„ i?i that were added to pi to construct 

^1. This implies that pi has a triplex {{A, C}, Bi). Assume that Bi B2 is in Gi but B1-B2 
or Bi <-o B2 is in G2. Note that A °^ Bi or Bi -(r-o C is in G2 because, as noted above, p2 is 
(Z u Vr)-open. Assume without loss of generality that A Si is in G2. Then, A - Bi B2 
or A <^ Bi ^ B2 is in Gi whereas A Bi - B2 or A <^ Bi ^ B2 is in G2. Therefore, A and 
B2 must be adjacent in Gi and G2 because these are triplex equivalent. This implies that 
A S2 is in Gi. Moreover, A & Z because, otherwise, we can construct a route that is shorter 
than ^1 by simply removing Bi from ^1, which is a contradiction. This implies that A Bi 
is in G2 because, otherwise, p2 would not be {Z u VF)-open. This implies that A ^ Bi - B2 
or A Bi ^ B2 is in G2, which implies that A - i?2 or ^4 -^-o i?2 is in G2. The situation is 
depicted in the following subgraphs. 
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Now, let A' be the node that precedes ^4 in pi. Note that A' <- A cannot be in pi or p2 
because, otherwise, these would not be (ZuVF)-open since Ae Z. Then, A' - A or A' <^ A is 
in Gi and G2. Then, A' - A ^ B2 or A' A ^ B2 is in d whereas A' - A ^ B2, A' - A- B2, 
A' A B2 or A' A - B2 is in G2. These four subgraphs of G2 imply that A' and 
B2 must be adjacent in Gi and G2- The second subgraph due to the constraint C3 because 
A ^ Bi is in G2, and the other three subgraphs because Gi and G2 are triplex equivalent. 
By repeating the reasoning in the paragraph above, we can conclude that A' B2 is in Gi, 
which implies that A' € Z, which implies that A' - A or A' ^ A is in G2, which implies that 
A' - B2 or A! ^ B2 is in G2. 

By repeating the reasoning in the paragraph abovejl we can conclude that a ^ B2 is in Gi 
and, thus, we can construct a route that is shorter than gi by simply removing some nodes 
from Qi, which is a contradiction. Consequently, Bi B2 must be in 6*2. 

Finally, assume that Bi ^ B2 B3 is in Gi but Bi B2 - B3 or Bi B2 ^ B3 is in 
6*2. Then, Bi and B3 must be adjacent in Gi and G2 because these are triplex equivalent. 
This implies that Bi S3 is in Gi, which implies that we can construct a route that is 
shorter than ^1 by simply removing B2 from ^1, which is a contradiction. By repeating this 
reasoning, we can conclude that i?i . . . i?„ is in (^2 and, thus, that p2 is Z-open. 

□ 

We mentioned in the previous section that MAMP CGs are a proper generalization of AMP 
CGs and MVR CGs, as there are independence models that can be induced by the former 
but not by the latter. Moreover, we gave the following example and postponed the proof that 
it cannot be induced by AMP CGs and MVR CGs. 

A^B — G 

I 

D^E 



With the help of Theorem HI we can now give the proof. Assume to the contrary that the 
independence model induced by the MAMP CG G above can be induced by an AMP CG H. 
Note that if is a MAMP CG too. Then, G and H must have the same triplexes by Theorem 
H Then, H must have triplexes {{A,D},B) and ({A,G},B) but no triplex ({G,D},B). So, 
C-B-D must be in H. Moreover, H must have a triplex {{B, E},G). So, G E must be in 
H. However, this implies that H does not have a triplex ({C, D},E), which is a contradiction 
because G has such a triplex. To see that no MVR CG can induce the independence model 

^Let A" be the node that precedes A' in pi. For this repeated reasoning to be correct, it is important to 
realize that ii A' - A is in G2, then A" ^ A' must be in G2, because A' e Z and p2 is {Z u W)-opcn. 



14 



induced by G, simply note that no MVR CG can have triplexes ({^4, D},B) and {{A, C},B) 
but no triplex {{C,D},B). 

We prove below that every triplex equivalence class of MAMP CGs has a distinguished 
member. We say that two nodes form a directed node pair if there is a directed edge between 
them. 

Lemma 1. For every triplex equivalence class of MAMP CGs, there is a unique maximal 
(wrt to set inclusion) set of directed node pairs st some CG in the class has exactly those 
directed node pairs. 

Proof. Assume to the contrary that there are two such sets of directed node pairs. Let the 
MAMP CG G contain exactly the directed node pairs in one of the sets, and let the MAMP 
CG H contain exactly the directed node pairs in the other set. For every A ^ B m G si 
A - B oi A ^ B IS in H , replace the edge between A and B in H with A ^ B and call the 
resulting graph F. We prove below that F is a MAMP CG that is triplex equivalent to G 
and thus to i/, which is a contradiction since F has a proper superset of the directed node 
pairs in H. 

First, note that F cannot violate the constraints C2 and C3. Assume to the contrary that 
F violates the constraint CI due to a cycle p. Note that none of the directed edges in p can be 
in H because, otherwise, H would violate the constraint CI, since H has the same adjacencies 
as F but a subset of the directed edges in F. Then, all the directed edges in p must be in G. 
However, this implies the contradictory conclusion that G violates the constraint CI, since 
G has the same adjacencies as F but a subset of the directed edges in F. 

Second, assume to the contrary that G (and, thus, H) has a triplex {{A, C},B) that F has 
not. Then, {A, B} or {B, C} must an directed node pair in G because, otherwise, F would 
have a triplex ({A,C},B) since F would have the same induced graph over {A,B,C} as H. 
Specifically, A ^ B or B ^ C must be in G because, otherwise, G would not have a triplex 
{{A,C},B). Moreover, neither A B nor B ^ C can be H because, otherwise, H would 
not have a triplex {{A, C},B). Therefore, if A B or B <^ G is in G and neither A<- B nor 
B ^ C is in H, then A ^ B or B <^ C must be in F. However, this implies that B ^ C or 
A B must be in F because, otherwise, F would have a triplex {{A, C},B) which would be 
a contradiction. However, this is a contradiction since neither B ^ C nor A <^ B can be in 
G or H because, otherwise, neither G nor H would have a triplex {{A, C},B). 

Finally, assume to the contrary that F has a triplex ({A,C},B) that G has not (and, thus, 
nor does H). Then, A - B - C must be in H because, otherwise, A B or B ^ C would be 
in H and, thus, F would not have a triplex ({A,C},B). However, this implies that A ^ B 
or B <- C is in G because, otherwise, F would not have a triplex {{A,C},B). However, this 
implies that B ^ G or A <^ B is in G because, otherwise, G would have a triplex ({A, C}, B). 
Therefore, A^B^C or A^B^C is in G and, thus, A->-B^C or A^B^C must be 
in F since A - B - G is in H. However, this contradicts the assumption that F has a triplex 
i{A,C},B). □ 

A MAMP CG is a directed CG (DCG) if it has exactly the maximal set of directed node 
pairs corresponding to its triplex equivalence class. Note that there may be several DCGs in 
the class. For instance, the triplex equivalence class that contains the MAMP CG A^ B has 
two DCGs (i.e. A^ B and A^ B). 

Lemma 2. For every triplex equivalence class of DCGs, there is a unique maximal (wrt to 
set inclusion) set of bidirected edges st some DCG in the class has exactly those bidirected 
edges. 

Proof. Assume to the contrary that there are two such sets of bidirected edges. Let the DCG 
G contain exactly the bidirected edges in one of the sets, and let the DCG H contain exactly 
the bidirected edges in the other set. For every A B in G st A - B is in H , replace 
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Table 1. Subfamilies of MAMP CGs. 



MAMP CGs 



AMP CGs 



RCGs 




MVR CGs 



Bayesian networks 



Covariance graphs 



A - B with A B in H and call the resulting graph F. We prove below that F is a DCG 
that is triplex equivalent to G, which is a contradiction since F has a proper superset of the 
bidirected edges in G. 

First, note that F cannot violate the constraint CI. Assume to the contrary that F violates 
the constraint C2 due to a cycle p. Note that all the undirected edges in p are in H. In fact, 
they must also be in G, because G and H have the same directed node pairs and bidirected 
edges. Moreover, the bidirected edge in p must be in G or H. However, this is a contradiction. 
Now, assume to the contrary that F violates the constraint C3 because A-B-G and B ^ D 
are in F but A and C are not adjacent in F (note that if A and C were adjacent in F, then 
they would not violate the constraint C3 or they would violate the constraint CI or C2, which 
is impossible as we have just shown). Note that A - B - C must be in H. In fact, A-B-G 
must also be in G, because G and H have the same directed node pairs and bidirected edges. 
Moreover, B ^ D must be in G or H. However, this implies that A and G are adjacent in 
G or if by the constraint C3, which implies that A and G are adjacent in G and H because 
they are triplex equivalent and thus also in F, which is a contradiction. Consequently, F is 
a MAMP CG, which implies that F is a DCG because it has the same directed edges as G 
and H. 

Second, note that all the triplexes in G are in F too. 

Finally, assume to the contrary that F has a triplex {{A, G},B) that G has not (and, thus, 
nor does H). Then, A - B - C must be in H because, otherwise, A ^ B or B G would 
be in H and thus F would not have a triplex {{A,G},B). However, this implies that F has 
the same induced graph over {A, B, G} as G, which contradicts the assumption that F has a 
triplex ({AG}, E). □ 

A DCG is a bidirected DCG (BDCG) if it has exactly the maximal set of bidirected edges 
corresponding to its triplex equivalence class. Note that there may be several BDCGs in the 
class. For instance, the triplex equivalence class that contains the MAMP CG A B has two 
BDCGs (i.e. A ^ B and A B). Note however that all the BDCGs in a triplex equivalence 
class have the same triplex edges, i.e. the edges in a triplex. 



5. Discussion 



In this paper we have introduced MAMP CGs, a new family of graphical models that 
generalize AMP CGs and MVR CGs. We have described global and local Markov properties 
for them and proved their equivalence for compositional graphoids. We have also charac- 
terized when two MAMP CGs are Markov equivalent. We conjecture that every Markov 
equivalence class of MAMP CGs has a distinguished member. We are currently working 
on this question. It is worth mentioning that such a result has been proven for AMP CGs 



(IRoverato and Studenvl . l2006l ). We are also working on a constraint based algorithm for 



learning a MAMP CG a given probability distribution is faithful to. The idea is to combine 
the le arning algorithms that we have recently proposed for AMP CGs (Pena, 20121 ) and MVR 
CGs (ISonntag and Peiial . [2oTi ) . 
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We believe that the most natural way to generalize AMP CGs and MVR CGs is by allowing 
undirected, directed and bidirected edges. However, we are not the first to introduce a family 
of graphical models whose members may contain these three types of edges. In the rest of this 
section, we review some works t hat have do ne it before us, and explain how our work differs 
from them. ICox and WermuthI (119931 . 119961 ) introduced regression CGs (RCGs) to generalize 
MVR CGs by allowing them to have also undirected edges. The separation criterion for RCGs 
is identical to that of MVR CGs. Then, there are independence models that can be induced 
by MAMP CGs but that cannot be induced by RCGs, because RCGs generalize MVR CGs 
but not AMP CGs. For instance, the independence model induced by the AMP CG G 



B — C — D 



cannot be induced by any RCG. To see it, assume to the contrary that it can be induced by 
a RCG H. Note that i^T is a MAMP CG too. Then, G and H must have the same triplexes 
by Theorem m Then, H must have triplexes ({A, and ({A, D},C) but no triplex 

({S, D}, C). So, B^G^D, B^G-D, B^C^D or B-C^D must be in H. 
However, this implies that H does not have the triplex {{A, B},G) or {{A, D},G), which is a 
contradiction. It is worth mentioning that, although RCGs can have undirected edges, they 
cannot have a subgraph of the form A <^ B -G. Therefore, RCGs are a subfamily of MAMP 
CGs. Table [1] depicts this and other subfamilies of MAMP CGs. 

Another family of graphical models whose members r nay contain undirected, directe d and 
bidirected edges is maximal ancestral graphs (MAGs) ( Richardson and Spirted . 2002 ). The 
separation criterion for MAGs is identical to that of MVR CGs. Therefore, the example above 
also serves to illustrate that MAGs generali ze MVR CGs but not AMP CGs, as MAMP CGs 
do. See also ( Richardson and Spirted . 2002[ p. 1025). It is worth mentioning that, although 
MAGs can have undirected edges, they must comply with certain topological constraints. As 
of today, we do not know if MAGs are a subfamily of MAMP CGs. We are currently working 
on this question. However, even when an independence model can be induced by a MAG, a 
MAMP CG may be preferred, as the following example illustrates. Consider the MAMP CG 
A<-B-^G-D-E and call it G. Consider the independence model resulting from G by 
marginalizing out B. This model can be induced by the MAMP CG A^G-D-E, or hy 
the MAGs A'^^G-^D'^E or A'>^G^D-^E. However, the MAGs suggest the existence 
of the causal relationship G ^ D or D ^ E, although neither exists in G. On the other hand, 
the MAMP CG does not suggest any causal relationship and, thus, it is preferred. 

Note that the result of marginalizing out B in the MAMP CG in the previous example 
results in another MAMP CG. Unfortunately, this property does not hold in general, i.e. 
MAMP CGs are not closed wrt marginalization. On the other hand, MAGs are closed wrt 
marginalization and conditioning. This is a desirable property, because one can compare the 
result of different manipulations on the system being modeled. We plan to study how to 
extend or modify MAMP CGs so that they become closed wrt marginalization and condi- 
tioning. 

Finally, three other families of graphical models whose members may contain undirected, 
directed and bidirected edges are summary graphs after replacing the dashed un directed edges 
with bidirected edges (Cox and Werniuth . 1996 ). MC graphs ('Kostei. 2002), and loopless 
mixed graphs ( Sadeghi and Lauritzenl . I2OI2I ). As shown in (Sadeg hi and Lauritzenl . 20121 
Sections 4.2 and 4.3), every independence model that can be induced by summary graphs 
and MC graphs can also be induced by loopless mixed graphs. The separation criterion for 
loopless mixed graphs is identical to that of MVR CGs. Therefore, the example above also 
serves to illustrate that loopless mixed graphs generalize MVR CGs but not AMP CGs, as 
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MAMP CGs do. See also ( ISadeghi and Lauritzenl . I2012L Section 4.1). Moreover, summary 
graphs and MC graphs have a rather cou nterintuitive and undes i rable feature: Not every 
missing edge corresponds to a separation (IRichardson and Spirte 2. l2002l . p. 1023). MAMP 
CGs, on the other hand, do not have this disadvantage (recall Theorem H]). 

In summary, MAMP CGs are the only graphical models we are aware of that general- 
ize both AMP CGs an d MVR CGs. A by-product of this generalization is the following. 
Andersson et al. ( 2001 . Section 5) and Kang and TianI ( 2009 . Section 2) show that any regu- 
lar Gaussian probability distribution that is Markovian wrt an AMP CG or MVR CG G can 
be expressed as a system of linear equations with correlated errors whose structure depends 
on G. We extend below this result to MAMP CGs. 

Let p denote any regular Gaussian distributions that is Markovian wrt a MAMP CG 
G. Assume without loss of generality that p has mean 0. Let Ki denote any connectivity 
component of G. Let ^ and ^^^^^ ^ denote submatrices of the precision matrix fi* of 
p(Ki,paG{Ki)). Then, as shown by fBishop ( 20061 Section 2.3.1), 



where 
and 



K,\paG{Ki)^Af{l3'paG{K,),A') 



Then, p can be expressed as a system of linear equations with normally distributed errors 
whose structure depends on G as follows: 

K,^^'paG{Ki) + e' 

where 

-AA(0,A0. 

Note that for all A,BeKi st ucg{A) = ucg(B) and A- B is not in G, AlGB\paG{Ki)uKi \ 
A \ B and thus (A^^^^^^ ucc(A))a^b ~ ^ t lLauritzenl . 119961 . Proposition 5.2). Note also that for 
all A,B € Ki st ucg(A) ^ ucg(B) and A ^ B is not in G, AlGBlpadKi) and thus A^ ^ = 0. 
Finally, note also that for all A e and B e paciKi) si A B is not in G, A ]_ GB\paG{A) 
and thus - 0. Let Pa contain the nonzero elements of the vector (/3*)a,.- Then, p can 

be expressed as a system of linear equations with correlated errors whose structure depends 
on G as follows. For any Ae Ki, 

A = /3ApaGiA) + e^ 

and for any other B e Ki, 

covariance{e^ , e^) - A^ ^. 

It is worth mentioning that the mapping above between probability distributions and sys- 
tems of linear equations is bijective. W e omit the p roof of this fact here, but it can be proven 



much in the same way as Lemma 1 in iPenal (120 lOf ). Note that each equation in the system 



of linear equations above is a univariate recursive regression, i.e. a random variable can be a 
regressor in an equation only if it has been t he reg ressand in a previous equation. This has 
two main advantages, as Cox and Wermuthl (1993, p. 207) explain: "First, and most impor- 
tantly, it describes a stepwise process by which the observations could have been generated 
and in this sense may prove the basis for developing potential causal explanations. Second, 
each parameter in the system [of linear equations] has a well-understood meaning since it is 
a regression coefficient: That is, it gives for unstandardized variables the amount by which 
the response is expected to change if the explanatory variable is increased by one unit and 
all other variables in the equation are kept constant." Therefore, a MAMP CG can be seen 
as a data generating process and, thus, it gives us insight into the system under study. 
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We close this section by introducing marginal-conditional AMP (MCAMP) CGs, which 
extend MAMP CGs with what we call convergent edges. We defer a thorough study of 
MCAMP CGs. Note that the edges in a MAMP CG can be spht into those that have no 
head and no tail (undirected edges), one head and one tail (directed edges), and two heads 
and no tail (bidirected edges). Therefore, one may say that there is a combination missing, 
namely edges with no head and two tails. We call these edges convergent and represent them 
as B. 

In a graph, a triplex ({A,C}, B) is an induced subgraph of the form A B ^C, A 
B-C,otA-B-^C. a graph G containing possibly directed, undirected, bidirected and 
convergent edges is a MCAMP CG if 

CI. G has no cycle Vi, . . . , V„ = Vi st Vi ^ V2 is in G and Vi <>->■ Vi+i, Vi - V^+i or Vi V^+i 

is in G for all 1 < i < n, 
C2. G has no cycle T^i , . . . , Ki = 14 st T^i •<->■ V2 is in G and Vi - V^+i or Vi V^+i is in G 

for all 1 < 2 < n, 

C3. if G has a subgraph Vi - V2 - V3 st V2 V4 is in G, then Vi and V3 are adjacent in G, 
and 

C4. if G has a triplex ({14, V3}, V2) si V2 ^V^ or V2 ^ . . . ^V^^V^ are in G, then V^ 
and V3 are adjacent in G. 

The semantics of MCAMP CGs is as follows. A node B in a path p in a MCAMP CG G 
is called a triplex node m p ii A ^ B C , A ^ B - C , or A - B <ro C is a, subpath of p. 
Moreover, p is said to be Z-open with Z ^V when 

• every triplex node i? in /? is in Z u sandZ) unless B D or B . . . ^ D E is 

in G, and 

• every non-triplex node B in p is outside Z unless A - B - C is a subpath of p and 
S ^ i:) is in G or S ^ i:» is in G with D i Z. 

Let X, Y and Z denote three disjoint subsets of V. When there is no Z-open path in G 
between a node in X and a node in y, we say that X is separated from Y given Z in G 
and denote it as X 1 gY\Z. Wc denote hj X J. g^\Z that X 1 gY\Z does not hold. The 
independence model induced by G is the set of separation statements X ]_g Y\Z. 

The motivation behind the four constraints in the definition of MCAMP CGs is as follows. 
The constraint CI generalizes the constraint CI of MAMP CGs. For the constraints C2- 
C4, note that typically every missing edge in a graphical model corresponds to a separation. 
However, this may not be true for graphs that do not satisfy these constraints. For instance, 
the graph G to the left below does not contain any edge between B and D but B J_gD\Z for 
all Z ^V \ {B,D}. Since this situation is counterintuitive, we enforce the constraint C3. 




Let now G be the graph to the right above. Note that G does not contain any edge between 
B and D but S/GD|Zforall Z ^V \ {B,D}. Since this situation is again counterintuitive, 
we enforce the constraint C4 on G. We have in principle five ways to do so, i.e. there are 
five ways to make B and D adjacent in G. However, in this case they reduce to the following 
three due to the constraint CI. 
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However, in each of the three graphs above, there is no edge between A and E but AIgE\Z 
for aX\ Z \ {A,E}. Since this is again counterintuitive, wc enforce the constraint C2. 

Given a MCAMP CG G, let G denote the AMP CG obtained by replacing every bidirected 
edge A B m G with A Lab ^ B and every convergent edge A B m G with 
A -> Sab B. Let S denote all the nodes of the form Sab in G. Then, note that X ]_gY\Z 
iff X 1 qY\Z u S for all X, y and Z disjoint subsets of V. Therefore, every MCAMP CG 
can be seen as the result of marginalizing out some nodes and conditioning on some others 
in an AMP CG, hence the name. We conjecture that all the results proven in this paper for 
MAMP CGs can be extended to MCAMP CGs. 
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